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Electronic communication records provide detailed information about temporal aspects of human 
interaction. Previous studies have shown that individuals' communication patterns have complex 
temporal structure, and that this structure has system-wide effects. In this paper we use mobile 
phone records to show that interaction patterns involving multiple individuals have non-trivial tem- 
poral structure that cannot be deduced from a network presentation where only interaction frequen- 
cies are taken into account. We apply a recently introduced method, temporal motifs, to identify 
interaction patterns in a temporal network where nodes have additional attributes such as gender 
and age. We then develop a null model that allows identifying differences between various types 
of nodes so that these differences are independent of the network based on interaction frequencies. 
We find gender-related differences in communication patters, and show the existence of temporal 
homophily, the tendency of similar individuals to participate in interaction patterns beyond what 
would be expected on the basis of the network structure alone. We also show that temporal patterns 
differ between dense and sparse parts of the network. Because this result is independent of edge 
weights, it can be considered as an extension of Granovetter's hypothesis to temporal networks. 



Traditional methods of collecting data on human in- 
teractions are limited to small samples and have poor 
temporal resolution [T] , and research on dynamic aspects 
has been limited to large scale structural changes [2]. 
Availability of detailed electronic communication data 
has opened unprecedented opportunities, revolutionizing 
tools and scope of network studies H] . This has lead to 
better understanding of human interaction networks at 
the societal scale ^ as well as of their mesoscopic struc- 
ture [6j. Records of mobile phone calls, emails, tweets, 
and messages sent in social networking sites enable the 
investigation of communication behavior with fine time 
scale, and has already revealed the peculiar, bursty qual- 
ity of human communication [7HS] . Interactions in vari- 
ous other complex systems also display rich time-domain 
behavior [10] . 

Complex networks has become a common approach for 
analyzing large social systems pTl - fTS] . Social networks 
can be constructed for example by aggregating commu- 
nication records in time. In such aggregate networks the 
nodes correspond to people and edges denote their re- 
lations as inferred from the communication records; in 
weighted aggregate networks edge weights are used to 
denote communication frequency [5]. There are, how- 
ever, phenomena that cannot be approached from this 
static network point of view |14l 115) . For example, even 
individuals who are highly connected in the aggregate 
network may only interact with a small number of ac- 
quaintances at a time |16| . decreasing their importance 
as information-spreading hubs. The functional subunits 
of a system might not manifest in the existence of con- 
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nections, but rather in the temporal structure of interac- 
tions. Consider for example members of a social group 
exchanging messages, brain areas activating in concert 
in response to stimuli, or software modules calling one 
another in a specific sequence. 

Increased awareness about the importance of tempo- 
ral information has led to the emergence of temporal net- 
works [lili , a framework that has been used to study such 
diverse systems as information flow in human communi- 
cation [HI [TS] , interactions of ants [T^ , spreading of sex- 
ually transmitted diseases [18] , characteristic patterns of 
face-to- face interactions [THl HO], and transportation of 
livestock [5T]. In this article we study the meso-scale 
structure of temporal networks by using temporal motifs 
|22j . The temporal networks we study are colored: there 
are multiple types of nodes and events, each type distin- 
guished by a different color. In the social context node 
types can refer to individuals' attributes such age and 
gender. Our goal is to identify differences in the relative 
frequency of temporal motifs between different node and 
event types; furthermore, we want the identified differ- 
ences to be independent of the structure of the weighted 
aggregate network. 

Temporal motifs are analogous to network motifs origi- 
nally introduced by Milo et al. in 2002 [25ti25] . Network 
motifs were defined as classes of isomorphic subgraphs 
that are more common in the empirical network that in 
some null model, the most commonly used null model 
being the configuration model, a random network that 
retains the degrees of all nodes [26] . The use of a random 
network to define motifs has however proven problematic 
[?7] and we therefore adopt the usage of Onnela et al. 
[28] and use the term "motif" more generally to denote 
a class of subgraphs, independent of their statistical sig- 
nificance in comparison to some reference. The problem 
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of constructing a suitable null model is central in motif 
analysis, and the development of a null model to use with 
colored temporal networks is one central contribution of 
this article. 

Using a large data set of mobile phone call records 
we find that node attributes have a significant effect on 
the occurrence of temporal motifs. We identify temporal 
homophily, over-representation of temporal patterns that 
contain similar nodes beyond that predicted by the struc- 
ture of the aggregate network. We also find consistent 
and robust differences between events occurring in dense 
and sparse parts of the aggregate network. Because our 
results are independent of the structure of the weighted 
aggregate network, this can be seen as a temporal exten- 
sion of Granovetter's hypothesis about the correlation of 
local density and edge weights ^29) . 



I. TEMPORAL MOTIFS IN COLORED 
NETWORKS 

Temporal motifs are defined as equivalence classes of 
connected event sets [25]. To explain this in more de- 
tail, consider a temporal network Gt — {V, E) where 
the events E represent interactions between the nodes 
V (Figure [T|^). An event = {vifl,Vi^i,ti,6i) £ E from 
node Vifi £V to Vi^i G V starts at time ti and has dura- 
tion 5i [42] . In this article we also presume that the tem- 
poral network is colored: there is a mapping p : V ^ C 
from nodes to the set of possible colors C. Colors can be 
used to distinguish different node types. 

Given a time window Af, two events are At- adjacent 
if they share at least one node and the time difference 
between them is no longer than At. With adjacency we 
can define connectivity: two events are At- connected if 
there exists a sequence of At-adjacent events between 
them. A temporal subgraph can now be defined as a set 
of events where any two events are At-connected. If the 
events in this set are also consecutive for each node, the 
temporal subgraph is valid [43 . 

Finally, a temporal motif m is an equivalence class of 
valid temporal subgraphs; two subgraphs are considered 
equivalent if their underlying colored graphs are isomor- 
phic and their events occur in the same order (Figure 
[1J3). Given a temporal network, motif count C{m) is the 
number of valid temporal subgraphs in equivalence class 
m; the algorithms given in |22j allow calculating C(m) 
for small motifs in colored temporal networks with up to 
10^ events. From now on we will simply use the term 
motif to refer to colored temporal motifs. 



Appendix [P] we describe how to construct a null model 
to calculate reference count C{m) that corresponds to 
the count of motif m under the null hypothesis that motif 
counts do not depend on node types, given the structure 
of the weighted aggregate network. Conditioning on the 
aggregate network is crucial: it means that any difference 
observed between C{m) and C{m) cannot be explained 
by differences in the number of nodes of each type, activ- 
ity of node types, or preferred connectivity patterns. The 
results obtained by comparing against this null model are 
purely temporal: they are independent of anything ob- 
servable in the aggregate network. 

A. Synthetic data 

To illustrate what this null model can reveal we first 
apply it to synthetic data where we know exactly what 
there is to find. The synthetic data has two types of 
nodes, red and blue, with events occurring in such a fash- 
ion that those causal chains are more common where the 
first event takes place between nodes of the same color 
(see Appendix [C] for details). The weighted aggregate 
network, however, has no structure. In particular, it is 
impossible to discern any difference between red and blue 
nodes in the aggregate network. 

In the following we analyze the temporal patterns us- 
ing all two-event motifs (Figure [Tp). To identify whether 
the null hypothesis is true — that there are no differences 
between node types given the aggregate network — we cal- 
culate the 2-score 
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a{C{m)) 

where fi{C{m)) and a{C{m)) are the mean and standard 
deviation of the count in the null model. If the null hy- 
pothesis is true, z-scores are expected to have zero mean 
and unit variance. As expected. Figure [2] shows that for 
the synthetic data this is not the case. 

While the z-score is useful for evaluating whether the 
null hypothesis is true or not, it does not directly reveal 
effect size. Instead of z-score, we present results using 
the ratio r(m) = C [m) / n,{C {m)) . Because C{m) is the 
motif count under the null hypothesis that node types 
have no effect, the ratio r(m) reveals how much more 
or less common a motif is because of node types. In 
the synthetic data the overrepresented causal motifs have 
r(m) « 1.18, that is, they are 18 % more common than 
expected if there were no difference between node types. 



II. NULL MODEL FOR DIFFERENCES 
BETWEEN NODE TYPES 

Just as with static motifs, the motif counts alone are 
not very informative: with nothing to compare with it is 
difhcult to say whether a given count is high or low. In 



III. RESULTS 

We now turn to study a mobile phone data set that 
contains 600 million calls during a period of 6 months be- 
tween 6.3 million anonymized customers (see Appendix 
fAj). As the data includes information on the time and 
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FIG. 1. (A) A schematic presentation of two temporal networks. The top one has clear temporal structure, while the bottom one 
is random; however, both give rise to the same aggregate network. (B) Two examples on identifying temporal motifs. Starting 
from a temporal network (left) we first identify temporal subgraphs (middle) and then the temporal motif corresponding to 
each subgraph (right). Note that temporal motifs do not contain information about the identities of nodes or the exact times of 
events, but do retain information about node colors and the temporal order of events. (C) The six possible uncolored two-event 
temporal motifs whose colored variants are used in our analysis. 
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FIG. 2. The distribution of z-scores for all two-event motifs 
in the synthetic data, averaged over 50 data sets. For most 
motifs 2 ~ because there are no differences between node 
colors. The peak at z « 17 corresponds to the four causal 
chains where the first event occurs between nodes of the same 
color. Because expected motif count is defined by the aver- 
age count of uncolored motifs, the peak at z ~ — 17 contains 
the four remaining causal chains where the first event occurs 
between nodes of different color. 



by combining gender, age group, and payment type. 



A. Node types affect motif counts 

We first check whether the null hypothesis is true or 
false — whether node types affect motifs counts beyond 
what can be expected based on the aggregate network — 
by plotting the distribution of z-scores, shown in Figure 
[SK- The distribution does not have zero mean or unit 
variance; instead, about 35% of motifs have \z\ > 1.96. 
The null hypothesis is clearly false, and we can conclude 
that motif counts are not independent of node types. 

For comparison, Figure[3|3 shows the same distribution 
after shuffling node types, and Figure [3p after shuffling 
event types (see Appendix |b| . In both cases the distri- 
butions suggest that the null hypothesis is true. Indeed, 
after randomizing node types there can be no differences 
between them even though the data still contains the 
same untyped temporal subgraphs as the original data. 
The time shuffled data, on the other hand, has exactly 
the same aggregate network as the original data. How- 
ever, because event times are now uncorrelated, all differ- 
ences between node types are explained by the structure 
of the aggregate network. 



duration of calls it can be represented as a temporal net- 
work. Node types are formed by combining the gender, 
age group, and payment type (prepaid or postpaid mobile 
subscription plan) of customers. Our analysis focuses on 
two-event motifs for simplicity (Figure [ip). Lar ger mo- 
tifs are not only more demanding computationally, but 
also more laborious to analyze: there are already 56448 
different two-event motifs with the 24 node types created 



B. There is temporal homophily 

Figure [3]A shows that there are differences between 
node types; we will now look at what these differences 
are. Homophily refers to the well-documented tendency 
of individuals to interact with others similar to them with 
respect to various social and demographic factors |311 - 
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FIG. 3. The distribution of z-scores for all two-event motifs with C(m) > 50 in (A) the first month of empirical data, (B) the 
same data after shuffling node types, and (C) after shuffling event times. The gray curve shows a Gaussian distribution with 
zero mean and unit variance for reference. We also show the mean ^, standard deviation a and the p- value of the Shapiro- Wilk 
test for normality. 



133] . Since social networks act as conduits of information, 
homophily limits the information that individuals can re- 
ceive. Here we investigate whether homophily also mani- 
fests at the level of contact sequences, i.e. whether there 
is temporal homophily, a tendency of similar individuals 
to jointly participate in interaction patterns beyond the 
homophily observed in the aggregate network. 

To this end we calculate the average r(m) of two-event 
motifs where all nodes have the same age group, gen- 
der, or payment type, or agree for all of these attributes. 
This average is then compared to the average r{m) of 
all other motifs. The results are presented in Table |l] 
For motifs involving only two individuals, the only sta- 
tistically significant difference is an under-expression of 
the returned call motif for participants of the same pay- 
ment type; detailed analysis (see SI) reveals that this is 
because of frequent patterns where a prepaid customer 
calls a postpaid customer, who then immediately calls 
back. 

More complex motifs — chains and stars — exhibit more 
evidence of temporal homophily. Those motifs where 
all participants are similar with respect to all three at- 
tributes are significantly more common. Star motifs are 
more common also when the participants agree with re- 
spect to only one attribute. The strongest effect is ob- 
served for similarity in payment type; it is, however, 
likely that the payment type correlates with various so- 
cioeconomic factors. Results for text messages are qual- 
itatively similar (see SI). 

C. Chains and stars are over-expressed for females 

Gender homophily is generally less strong than ho- 
mophily by age, race, or education [33]. However, gender- 
related differences in communication have been docu- 
mented at least in instant messaging [33], Facebook [55] . 
and the use of both domestic [3S] and mobile phones 



[371 [3H]- 

To analyze gender differences in temporal motifs we 
calculate the average r{m) separately for motifs where 
all participants are either male or female. The results 
are displayed in Table |Tl] No difference is observed for re- 
peated and returned calls, but for all other motifs the all- 
female case is over-expressed, and all-male case slightly 
under-expressed. While it is unrealistic to claim there 
to be a single explanation for this observation, the re- 
sult is in line with a study of French domestic phone use 
[36] where men's phone usage was identified to be more 
instrumental and women's more conversational. 



D. Local edge density correlates with temporal 
motifs 

The algorithm used for identifying temporal motifs also 
allows distinguishing between different event types [22] . 
Here we use event types to study the correlation between 
local network density and temporal patterns; this is re- 
lated to Granovetter's hypothesis [22] that states that 
in social networks there is a positive correlation between 
edge weights and local network density, where the latter 
can be measured for example by the number of triangles 
around an edge. This hypothesis has already been veri- 
fied in mobile phone call data [5]. Because our analysis 
factors out the entire structure of the weighted aggre- 
gate network, the results presented here are independent 
of this classic hypothesis. 

We use clique percolation [3^] to create a dichotomy for 
local edge density: event ei = {vi,o,Vi^i,ti,di) is a dense 
event if the edge {vi, Vj) of the aggregate network is inside 
a 4-clique community, and otherwise et is a sparse event. 
We find clear and robust differences in temporal behavior 
between dense and sparse edges, as summarized in Table 
|III[ Single-edge motifs — repeated and returned calls — are 
more common on sparse edges, while all other two-event 
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TABLE I. Temporal homophily of calls for different two-event motifs. The columns correspond to motifs where all participants 
are similar with respect to different attributes: age (A), gender (G), payment type (P), or all three (A A G A P). The first 
value in each cell is the mean r(m) for motifs where all nodes have the same attribute value (for example all have the same age 
in column A). The second value gives the mean for all other motifs. If the first value is larger than the second, the motif has 
homophily with respect to that attribute: motifs where all nodes have the same value are relatively more common than others. 
Welch's t-test was used to test for equality; bold denotes p < 0.01 and italic p < 0.05 (including a Bonferroni correction). 





A 


G 


P 


A A G A P 


Repeated call 


1.08, 


1.11 


1.12, 


1.09 


1.09, 


1.13 


1.09, 


1.11 


Returned call 


1.04, 


1.01 


1.02, 


1.01 


0.98, 


1.06 


1.02, 


1.02 


Non-causal chain 


1.05, 


1.03 


1.05, 


1.03 


1.05, 


1.01 


1.18, 


1.03 


Causal chain 


1.03, 


1.02 


1.04, 


1.02 


1.05, 


0.98 


1.16, 


1.02 


Out-star 


1.12, 


1.03 


1.06, 


1.03 


1.07, 


1.01 


1.32, 


1.04 


In- star 


1.09, 


1.04 


1.07, 


1.04 


1.03, 


1.06 


1.16, 


1.04 



TABLE II. Analysis of gender-dependent call homophily re- 
veals that all-female star and chain motifs are more common 
than respective all-male motifs. The first value in each cell 
is the average r(m) for motifs where all nodes have the same 
gender, and the second value is the average r(m) for all other 
motifs. Statistical testing was done as in Table [l] 





Female 


Male 


Repeated call 


1.11, 


1.11 


1.13, 


1.10 


Returned call 


1.02, 


1.01 


1.02, 


1.02 


Non-causal chain 


1.08, 


1.02 


1.01, 


1.04 


Causal chain 


1.08, 


1.01 


0.98, 


1.03 


Out-star 


1.10, 


1.03 


1.01, 


1.04 


In-star 


1.11, 


1.03 


1.01, 


1.05 



TABLE III. The median r{m) over all months for different 
two-event motifs when the events occur on either dense (D; 
inside a 4-clique community) or sparse (S; all other edges) 
edge. For the first two motifs both events take place on the 
same edge so they necessarily have the same type. 





D-D 


S-S 


S-D 


D-S 


Repeated calls 


0.88 


1.063 






Returned calls 


0.905 


1.052 






Non-causal chain 


1.110 


0.994 


0.890 


0.875 


Causal chain 


1.082 


1.005 


0.903 


0.892 


Out-star 


1.123 


1.015 


0.844 


0.838 


In-star 


1.121 


0.970 


0.886 


0.879 



motifs follow an opposite pattern and are relatively more 
common in dense parts of the network. One possible ex- 
planation is that sparse parts of the network offer less 
opportunities for motifs that occur on two edges. Were 
this the case, one would expect motifs with one dense 
and one sparse event to lie between the other cases; this 
is however not what we observe. The order of these four 
cases is also very robust: if we also include node types, 
the same pattern is observed for almost all combinations 
of node types (see SI). This is remarkable because each 
combination of node types essentially constitutes an in- 
dependent sample. 

Granovetter's hypothesis says that dense edges have on 
average higher weights. But in addition to having higher 
weights, we find that dense edges are more commonly 
related to group talk, temporal patterns involving more 
than two individuals. 



IV. DISCUSSION 

Human relations are inherently dynamic, and at the 
highest time resolution they manifest as sequences of 
interactions. Electronic communication records have 
proven especially useful for studying behavioral patterns 
of single individuals and relating this to the function- 



ing of the social system as a whole; one example is the 
ubiquity of burstiness in human communication |8 and 
its effect on spreading dynamics [T31 HH SOI SI] ■ In this 
article we begin to assess meso-scale temporal patterns, 
group interactions that cannot be observed in the the 
static network representation. 

The mobile phone data was found to have rich meso- 
scale temporal structure. While some results are easy to 
explain, such as the relative prevalence of repeated calls 
between prepaid and postpaid users, other equally robust 
and consistent results are less easy to account for, such 
as the correlation of recipients' age observed in out-stars 
(see SI). The connection between temporal motifs and 
local edge density was also found to be very robust — 
the same pattern was detected for most combinations of 
node types — and shows that dense and sparse edges have 
different roles in communication. Of course, event types 
can be used in similar fashion to study the correlation 
between temporal motifs and any other local network 
property. 

The framework introduced in this article is not limited 
to social systems but can be applied to various complex 
systems for which time-resolution data is available. The 
largest constraint is that the concepts introduced in [22] 
are currently applicable only to data where nodes have 
at most one event at a time, or where events have no du- 
ration. What makes this framework particularly useful 
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is the fact that any temporal differences identified are in- 
dependent of the aggregate network, and therefore com- 
plementary to any existing information on the weighted 
aggregate network. 
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Appendix A: Mobile phone data 

The data used in this article consists of six months 
of anonymized mobile phone records with a total of 625 
million calls and 207 million SMS. We divide the data into 
six consecutive months (periods of 30 days) and repeat 
the analysis separately in each period to make sure the 
results are consistent in time. The number of calls (SMS) 
in these periods ranges from 99.8 to 108.5 million (32.8 
to 37.0 million). 

Node types are based on customer meta data. Node 
type is a combination of three factors. The first two 
factors are gender and age, with age represented by six 
intervals with approximately 1 million users in each: 18- 
26, 27-32, 33-38, 39-45, 46-55, and 56-80. The third 
factor is payment type, which can be either postpaid or 
prepaid. Postpaid users are billed for past calls while pre- 
paid users pay for their calling time beforehand. Even 
though studying the effect of payment type is not our 
main interest, we include it in the node type because it 
can be expected to affect behavior — prepaid users can be 
expected to make less calls because their calls are typ- 
ically more expensive and calling time is limited — and 
because payment type is likely to correlate with various 
socioeconomic factors. 

Combining gender, payment type and age gives a to- 
tal of 2 x 2 x 6 = 24 different node types. The results 
have been calculated for the 6.22 million users with fully 
known type and with contract assigned to only one phone 
number [44j . In all calculations with the empirical data 
we use time window At = 10 minutes, which allows rea- 
sonable time for intentional reactions but should not in- 
clude too many serendipitously simultaneous events. 



Appendix B: Shuffling node types and event times 

We use two different kinds of shuffled data to illustrate 
that the null model correctly identifies a true negative 
result. The node type shuffled data is created by shuffling 
node types. That is, if q is the type (color) of node Vi in 
the empirical data, in the shuffled data this node has type 
Ccr(i) where ct is a random permutation of node indices. 

The time shuffled data is created in a similar fashion: 
if (J is a permutation of event indices, in the shuffled 
data event occurs at time ta-^i) and has duration Sc^i). 
However, because we need to enforce the constraint that 
nodes have no more than one event at a time, a stan- 
dard shuffling algorithm cannot be used. Instead we use 
a MCMC algorithm that switches the times of two ran- 
domly selected events if the switch does not result in 
some node having overlapping events. 



Appendix C: Synthetic temporal network data 

To construct the synthetic data we first create an undi- 
rected regular graph with N — 10'* nodes, each connected 
to fc = 5 random nodes, and assign node colors indepen- 
dently of network topology so that there are N/2 red and 
N/2 blue nodes. 

Events between the nodes are generated with the fol- 
lowing process. On every time step a sporadic event oc- 
curs on an edge with probability p = 0.0001. If the spo- 
radic event takes place between two nodes of the same 
color, say from itoj, then for the next 100 time steps the 
recipient j has an additional probability of p to initiate a 
triggered event towards a random neighbor other than i. 
Event durations are drawn from a geometric distribution 
with mean ^ — 10, and nodes may only participate in 
one event at a time. New events are generated from this 
process until there are on average 100 events per edge. 
Motifs are identified with At = 100. 

Note that the distinction between sporadic and trig- 
gered events is only made when generating the data; the 
final data has only one kind of events. Because the un- 
derlying network is random and regular, and because the 
occurrence of neither sporadic nor triggered events on a 
given edge depends on node colors, this process results 
in a temporal network where all edges have on average 
the same number of events. 



Appendix D: Null model for assessing the influence 
of node types 

Let Ga — {V,L) be the aggregate network, and 
let i — [(ii, ji), . . . , (i„, j„)] denote a location, an or- 
dered sequence of edges of the aggregated network where 
{ikTjk) G L Vfc. If we presume that events take place on 
these edges in the order given, there is a unique tempo- 
ral motif rrn that corresponds to location £. We take into 
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account the structure of the aggregate network by mod- 
ehng the motif count Ce{m) on £ as a random variable 
under the null hypothesis Hq that motif count at £ does 
not depend on node types. 

What can the motif count depend on if not node types? 
There are two possible factors: the weights of the edges in 
£, and the network structure outside i. We approximate 
the latter effect to be negligible: the occurrence of a motif 
on £ does not depend on events taking place on other 
edges [45] . 

Edge weights, on the other hand, are likely to correlate 
strongly with motif counts. Let w = [wi-^^j-^^, . . . ,Wi^j^] 
denote a sequence of edge weights in the aggregate net- 
work and the weight sequence of the edges in £. As- 
suming Hq is true and given the above approximation, 
Ci{m) is independent of node types and depends only on 
Wf. We thus write Ce{m) ^ P{m*^, w^) where to* is mo- 
tif TO without node types — in other words, Ci{m) follows 
a distribution parametrized by to| and w^. The distribu- 
tions P{m* , w) are estimated from data. By summing 
over all locations for which mi = m we obtain the total 
motif count under the null hypothesis: 

C{m) = ^dm) . 

ilmi—m 

To see that C(to) is an unbiased estimate when Hq is 



true we write its expected value as 

E[Cim)]= J2 E[Ceim)]=J2 E ^[^KH] 
= E = TO, = w}| • E[Ci{m) I Wf = w] 

w 

On the other hand, the empirical motif count can be 
written as 

£\mi—m w £\mi—m,wi—w 

= ^ \{£\mi ^ m,Wi = w}| • Cw(to) 

w 

where C^{m) is the average count of motif to at loca- 
tions with weight sequence w. Now E\C{m)\ = C{m) 
if E[Ci{m) I = w] = C^[m) Vw, which is indeed the 
case when Hq is true. In the SI we present an algorithm 
for generating samples of C (to) . 

Note that because the distributions P{m*,w) are es- 
timated from the data, the null model is not influenced 
by high motif counts on rare high weight edges: if there 
is only one location corresponding to weight sequence w, 
the distribution P(to*,w) is a delta function and sam- 
pling will always produce C^(to) = Ci{m). 
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SUPPLEMENTARY INFORMATION 

Generating samples from the null model 

Algorithm [T] can be used to generate samples from the 
null model described in Materials and Methods. The in- 
put consists of the temporal network (event set E), the 
maximum number of events in temporal motifs (nmax)j 
number of independent samples to generate (A'sampics), 
and the time window At used to identify temporal mo- 
tifs. The algorithm outputs motif counts in the empirical 
data (C) and the motif counts sampled from null model 
(C), corresponding to the count of motif m under the 
null hypothesis that motif counts do not depend on node 
types, given the structure of the weighted aggregate net- 
work. 

The algorithm consists of three parts. On lines 2-8 
we identify all temporal motifs in the empirical data and 
count the occurrence of motifs by location. This results in 
two data structures: M[£] gives the number of temporal 
motifs at location £ (note that the corresponding motif 
mi is uniquely defined by i), and C[m\ gives the total 
count of motif m in the empirical data. 

On lines 9-18 these two data structures are used to con- 
struct the distributions P{m*,w) for all combinations of 
the untyped motif m* and topology w. The result is a 
data structure P such that P[(m*, w)][C] is the number 
of locations with weight sequence w and topology defined 
by m* that have exactly C occurrences of the motif m* . 
Note that node colors are not used during this step; the 
distributions are constructed under the null hypothesis 
that node colors have no effect, in which case the distri- 
butions would be identical for different node colors. 

On lines 20-26 we draw samples from P(m*,w). We 
again go through all locations, but this time take into 
account the node colors. Because the final sample C(m) 
is a sum over only those locations that have correct node 
colors, the structure of the aggregate network is taken 
into account. Note that it is relatively cheap to gener- 
ate multiple samples for C{m) (lines 24-26); the only 
thing that needs to be repeated is drawing samples from 
P(m*,w). 



Homophily and gender differences for calls and SMS 



ALGORITHM 1. Algorithm for generating samples from 
the null model. E is the event set that defines the temporal 
network, rimax is the maximum number of events in temporal 
motifs to study, and Nsampies is the number of independent 
samples to generate for each C(m). The function ValidSub- 
GRAPHS goes through all valid temporal subgraphs with at 
most rimax events. The function Locations goes through all 
locations — ordered sequences of edges such that the underly- 
ing graph formed by these edges is connected — - in the aggre- 
gate network G that have at most rimax edges. The function 
returns both the empirical motif counts C and the sampled 
counts C. 

1: function SAMPLENULLMODEL(iJ, rlmax, Nsampies, At) 



2: Initialize M as Map(^ N). 

3: Initialize C as Map(m —5> N). 

4: for Gt in ValidSubgraphs(£, rimax. At) do 

5: Let £ be the location of Gt- 

6: Let m be the colored motif corresponding to Gt- 

7: Increment M[£]- 

8: Increment C[m]. 

9: Construct weighted aggregate network G from E. 

10: Initialize P as Map((m*,w) Map(N N)) 

11: for £ in Locations(G, rimax) do 
12: Let nil be the uncolored motif defined by £- 

13: Let W£ be the weight sequence at £ in G. 

14: if ^ e M then 

15: Let Ce = M[£] 

16: else 
17: Let Ce ^0 

18: Increment P[(m|, Wf)] [Gf]. 

19: Initialize G as Map(m — > List) 

20: for £ in LoCATlONS(G, n,nax) do 
21: Let me be the colored motif defined by £. 

22: Let rrie be the uncolored motif defined by £- 

23: Let We be the weight sequence at £ in G. 

24: for Z in 1 . . . Nsampies do 

25: Let Ce be a sample from P[(m|,Wf)] 

26: Add Gf to C[me][i]- 

27: return G,G 



(either 18-26, 27-32, 33-38, 39-45, 46-55, or 56-80). 

We will now discuss in more detail the results on differ- 
ent motif topologies. The discussion is based on results 
presented in Tables [SLTHSLI] and Figures [SLTHSLT21 



Tables [SLT] through |SI.4| show results on temporal ho- 
mophily for both calls and SMS. Parts of Tabl e [STT] 
and |SI.3| are presented in the main text. Figures |SI.1| - 
SI.6 show the most common 2-event motifs of each type 
for calls, and Figures [SLTHSTd^ for SMS. The motifs 
have been sorted according to median r(m) over all six 
months, and this value is shown below each motif. Open 
nodes correspond to postpaid customers, filled nodes to 
prepaid. Gender is denoted by node color: red for female, 
blue for male. Age is shown inside each node, and the 
number corresponds to the beginning of the age interval 



Repeated contacts 



Table [SL3| reveals that repeated calls are significantly 
more common between prepaid users. Figure [SI. 1| shows 
also that if a postpaid user is involved, the receiver is typ- 
ically still prepaid; the most common case where both are 
postpaid has r = 1.083. There is very little homophily 
with respect to gender or age. SMS shown in Figure [SLT] 
have a similar pattern, but the effect is much weaker. 
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TABLE SI.l. Temporal homophily of calls for different motifs. The columns correspond to different attributes: Age, Gender 
and Payment type. The first value in each cell is the mean r(m) for motifs where all nodes have the same attribute value 
(for example all have the same age in column A). The second value gives the mean for all other motifs. If the first value is 
larger than the second, the motif has homophily with respect to those attributes: cases where all nodes have the same value 
are relatively more common than others. Welch's t-test was used to test for equality; bold denotes p < 0.01 and italic p < 0.05 
(including a Bonferroni correction corresponding to the number of tests in this table). 





A 


G 


P 


A A G 


A A P 


GAP 


A A G A P 


Repeated contact 


1.08, 1.11 


1.12, 


1.09 


1.09, 1.13 


1.12, 1.11 


1.05, 


1.11 


1.11, 


1.11 


1.09, 


1.11 


Returned contact 


1.04, 1.01 


1.02, 


1.01 


0.98, 1.06 


1.06, 1.01 


1.00, 


1.02 


0.98, 


1.03 


1.02, 


1.02 


Non-causal chain 


1.05, 1.03 


1.05, 


1.03 


1.05, 1.01 


1.11, 1.03 


1.08, 


1.03 


1.06, 


1.03 


1.18, 


1.03 


Causal chain 


1.03, 1.02 


1.04, 


1.02 


1.05, 0.98 


1.08, 1.02 


1.07, 


1.02 


1.07, 


1.01 


1.16, 


1.02 


Out-star 


1.12, 1.03 


1.06, 


1.03 


1.07, 1.01 


1.22, 1.04 


1.18, 


1.04 


1.09, 


1.03 


1.32, 


1.04 


In-star 


1.09, 1.04 


1.07, 


1.04 


1.03, 1.06 


1.13, 1.04 


1.08, 


1.04 


1.06, 


1.04 


1.16, 


1.04 




TABLE SI.2 


. Homophily of SMS for different motifs. 












A 


G 


P 


A A G 


A A P 


GAP 


A A G A P 


Repeated contact 


1.03, 1.02 


1.03, 


1.02 


0.99, 1.06 


1.04, 1.02 


0.99, 


1.03 


1.00, 


1.03 


0.99, 


1.03 


Returned contact 


0.99, 1.02 


1.03, 


1.00 


1.00, 1.03 


1.00, 1.02 


0.98, 


1.02 


1.02, 


1.01 


0.99, 


1.02 


Non-causal chain 


1.02, 0.97 


1.09, 


0.95 


0.97, 1.02 


1.16, 0.97 


1.04, 


0.97 


1.09, 


0.96 


1.24, 


0.97 


Causal chain 


0.99, 0.98 


1.05, 


0.97 


0.95, 1.05 


1.09, 0.98 


1.00, 


0.98 


1.04, 


0.97 


1.11, 


0.98 


Out-star 


1.10, 1.00 


1.17, 


0.95 


1.04, 0.98 


1.35, 1.00 


1.17, 


1.00 


1.21, 


0.97 


1.49, 


1.01 


In-star 


1.02, 1.00 


1.13, 


0.97 


1.02, 0.98 


1.18, 0.99 


1.06, 


0.99 


1.16, 


0.97 


1.33, 


0.99 



TABLE SI. 3. Homophily for calls when either gender or pay- 
ment type is fixed. The first value is average r(m) for motifs 
where all nodes have the same value of gender (either Female 
or Male), or the same value of payment type (Postpaid or 
Prepaid). The second value is average r(m) for all other mo- 
tifs. 

G=Ma 



G=Fe 

Repeated contact 1.11, 1.11 

Returned contact 1.02, 1.01 

Non-causal chain 1.08, 1.02 1.01, 1.04 
Causal chain 
Out-star 
In-star 



P=Po P=Pr 
1.13, 1.10 0.90, 1.18 1.27, 1.05 
1.02, 1.02 1.00, 1.02 0.95, 1.04 
1.06, 1.00 0.96, 1.04 
1.08, 1.01 0.98, 1.03 1.06, 0.97 0.92, 1.03 

1.10, 1.03 1.01, 1.04 1.05, 1.03 1.11, 1.03 

1.11, 1.03 1.01, 1.05 1.03, 1.06 1.03, 1.05 



TABLE SI. 4. Homophily for SMS when either gender or pay- 
ment type is fixed. 





C= 


Fe 


C= 


Ma 


P= 


Po 


p= 


Pr 


Repeated contact 


1.02, 


1.03 


1.04, 


1.02 


0.98, 


1.04 


1.01, 


1.03 


Returned contact 


1.01, 


1.02 


1.05, 


1.00 


0.97, 


1.03 


1.04, 


1.01 


Non-causal chain 


1.08, 


0.96 


1.09, 


0.97 


0.91, 


1.03 


1.04, 


0.96 


Causal chain 


1.04, 


0.98 


1.06, 


0.98 


0.89, 


1.05 


1.05, 


0.96 


Out-star 


1.18, 


0.98 


1.15, 


1.00 


1.05, 


0.99 


1.01, 


1.01 


In-star 


1.09, 


0.99 


1.19, 


0.99 


1.02, 


1.00 


1.01, 


1.00 



Returned contacts 



Figure |SI.2| reveals a particularly strong regularity for 
returned calls: the most common motifs are those in 
which the first caller is prepaid and the second is post- 



paid. This shows up as heterophily by payment type 
in Table |SI.1| A plausible — if not exciting — explanation 
is that prepaid plans are generally more expensive and 
calling time is limited. 



Causal and non-causal chains 



The most surprising thing about the results on non- 
causal and causal chains (Figures SI.3 and SI.4) is that 
they are so similar. Both are more common for postpaid 
users and females. As shown in Table [SL2j for SMS there 
is homophily by gender but in-homophily by payment 
type. 



Out-star 



Out-stars for calls shown in Figure SI.5 have the high- 
est ratio scores of all motifs. There is strong homophily 
with respect to all attributes. All- female out-stars are 
common for both calls and SMS, while only the SMS 
out-star is overrepresented for males.. Figure [SL5| also re- 
veals a surprising homophily with respect to age: those 
cases where the two receives have similar age are most 
common. 
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In-star 



The exact opposite is true for all motifs that take place 



Figure SI. 6 shows that for calls in-stars are more com- 
mon when the receiver is prepaid and the two callers 
postpaid. There is some homophily by the age of the two 
callers, but this does not appear to be as strong as for 
out-stars. 



Local edge density and temporal motifs 



Figures |SI.13| through |SI.18| show the most common 
2-event motifs when events are separated into two cate- 
gories by local density. The green edges denote events on 
dense edges (inside a 4-clique community) and red edges 
all other events. Results have been calculated separately 
for six consecutive months, and the ratio shown below 
each motif is the median over all months. The node type 
is a combination of gender (red for female, blue for male) 
and payment type (open for postpaid, filled for prepaid); 
age categories are not used in order to limit the num- 
ber of different motifs. Because of the null hypothesis 
used, comparison of r{m) is valid only between motifs 
that have the same node types and differ only in event 
types. 

The results for different motif types are surprisingly 
consistent. For the two motifs with only one edge, re- 
peated calls in Figure |SI.13| and returned calls in Figure 
|SI.14[ the motifs with sparse edge are more common. 



on two edges (Figures SI.15 SI.18 1: these motifs are more 



common on dense edges. 

As discussed in main text, one possible explanation 
is that the local network density directly affects motif 
counts. After all, in order to construct the null model we 
approximate that the number of motifs at any location 
does not depend on neighboring edges. However, if this 
were true, we would expect those motifs that have one 
dense and one sparse event to fall between the two ex- 
tremes; this is however not the case. For all motifs with 
two edges, the most common motifs are those where both 
events are dense, the second those where both are sparse, 
and the least common are those where one edge is dense 
and the other is sparse. 

Note that these results are to a large extent indepen- 
dent from those shown in Figures |SI.l| |SI.r2| that only 
consider node types. For example, we saw above that 
returned contact is more common when the first caller is 
prepaid and the second postpaid. Looking at this pattern 
in Figure |SI.14| for example between prepaid female and 
postpaid female user, we have r(m) = 1.048 for sparse 
edges (1st row, 3rd from right) and r{m) — 0.898 for 
dense edges (3rd row, at right). Combining this result 
with those presented earlier, we find that the returned 
contact between prepaid female and postpaid female is 
more common than with other combinations of node 



types (Figure SL2), and more common on sparse than 



dense edges (Figure SI. 14 1 
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FIG. SI.l. Most common repeated contact motifs for calls ordered by r(m). 
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FIG. SI. 2. Most common returned contact motifs for calls ordered by r(m). 
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FIG. SI. 3. Most common non-causal chain motifs for calls ordered by r(m). 
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FIG. SI. 4. Most common causal chain motifs for calls ordered by r{m). 
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FIG. SI. 5. Most common out-star motifs for calls ordered by r{m). 
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FIG. SI. 6. Most common in-star motifs for calls ordered by r{m). 
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FIG. SI. 7. Most common 2-event burst motifs for SMS ordered by r(m). 
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FIG. SI. 8. Most common call-back motifs for SMS ordered by r(m). 
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FIG. SI. 9. Most common non-causal chain motifs for SMS ordered by r(m). 
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FIG. SI. 10. Most common causal chain motifs for SMS ordered by r(m). 
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FIG. SI. 11. Most common out-star motifs for SMS ordered by r(m). 
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FIG. SI. 12. Most common in-star motifs for SMS ordered by r(m). 
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FIG. SI. 13. Most common repeated contact motifs for calls ordered by r(m). Green events lie inside 4-clique communities, red 
corresponds to all other cases. Solid nodes are prepaid users, open nodes postpaid; blue nodes are male, red are female. Note 
that because of the null hypothesis used it only makes sense to compare the r(m) score between motifs that have the same 
node types and differ only in the event type. 
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FIG. SI. 14. Most common return contact motifs for calls ordered by r{m). 
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FIG. SI. 15. Most common non-causal chain motifs for calls ordered by r(m). 
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FIG. SI. 16. Most common causal chain motifs for calls ordered by r(m). 



27 




1.185 1.182 1.175 1.175 1.172 1.164 1.160 1.155 




1.151 1.150 1.146 1.135 1.133 1.131 1.130 1.129 




1.127 1.125 1.123 1.122 1.121 1.120 1.118 1.114 




1.112 1.110 1.109 1.108 1.107 1.103 1.102 1.102 




1.101 1.100 1.099 1.098 1.096 1.096 1.096 1.095 




1.095 1.093 1.093 1.091 1.090 1.089 1.088 1.086 




1.085 1.083 1.082 1.081 1.080 1.079 1.078 1.078 




1.077 1.076 1.072 1.071 1.070 1.070 1.068 1.065 




1.048 1.037 1.036 1.036 1.032 1.032 1.032 1.031 




1.031 1.031 1.030 1.029 1.026 1.025 1.025 1.024 



FIG. SI. 17. Most common out-star motifs for calls ordered by r(m). 
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FIG. SI. 18. Most common in-star motifs for calls ordered by r(m). 



